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We report the measurements of the ti production cross section and of the top quark mass using 
1.02 fb _1 of pp data collected with the CDF II detector at the Fermilab Tevatron. We select 
events with six or more jets on which a number of kinematical requirements are imposed by means 
of a neural network algorithm. At least one of these jets must be identified as initiated by a b- 
quark candidate by the reconstruction of a secondary vertex. The cross section is measured to 
be — 8.3 ± 1.0(stat.)ii'§(syst.) ± 0.5(lumi.) pb, which is consistent with the standard model 
prediction. The top quark mass of 174.0±2.2(stat.) ±4.8(syst.) GeV/c 2 is derived from a likelihood 
fit incorporating reconstructed mass distributions representative of signal and background. 

PACS numbers: 14.65.Ha, 13.85.Ni,13.85.Qk 



I. INTRODUCTION 



The measurement of the top quark properties allows one to verify the consistency of the standard 
model. At the Fermilab Tevatron Collider top quarks are produced mostly in pairs and the mea- 
surement of the ti cross section tests the next-to-leading-order QCD calculations. Moreover, accurate 
measurements of the top quark mass and of the mass of the W boson provide constraints on the mass 
of the hypothetical standard model Higgs boson |l[ . 

At the Tevatron center-of-mass energy, -y/s = 1.96 TeV, the predicted ti production cross section 
is 6.7pb[2| for an assumed top quark mass of 175 GeV/c 2 , with a total theoretical uncertainty of 
approximately 15%, due to the choice of renormalization and factorization scales. The top quark 
decays into a W boson and a b quark almost 100% of the time; the W boson subsequently decays to 
either a quark-antiquark pair or a lepton-neutrino pair. The resulting final states are then usually 
distinguished by the number of charged energetic leptons (e or ji) and the number of jets. 

In this analysis, we examine events characterized by a multijet topology and no energetic lepton 
( "all-hadronic" mode). This ti final state has the advantage of a large branching ratio 4/9) and of 
having no undetected top decay physical observables in the final state. In addition, discrepancies in 
top quark cross section and mass measurements between this and other decay channels could indicate 
contributions from physics beyond the standard model 3]. The major challenge of this channel is 
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the very large QCD multijet background which dominates the signal by three orders of magnitude 
after the application of the online trigger selection. To improve the signal-to-background ratio (S/B), 
requirements based on the kincmatical and topological characteristics of standard model ti events are 
expressed in terms of a neural network and applied to the data. This neural network selection reaches 
a S/B of about 1/12, improving the S/B by 60% with respect to the selection based on kinematical 
cuts used in previous analyses p], Q . The neural network selection is followed by the requirement of 
jets identified as originating from b quarks using a secondary vertex 6-tagging algorithm, achieving a 
S/B of about 1/2. 

The purity of the selected sample enables the clear observation of ti candidates and the measurement 
of the ti production cross section and of the top quark mass. Top quark events result in final states 
with a number of 6-tagged jets larger than in inclusive QCD multijet production, so we use the 
excess of such tags to measure the ti production cross section. A reconstructed top quark mass is 
then determined from a kinematical fit of the six leading jets in the event to a ti final state. The 
distribution of reconstructed top quark masses is then compared to the distributions expected from 
background and ti events simulated at various values of the top quark mass, to obtain the value which 
best describes the data. 

Given the theoretical uncertainties on the production cross section for events generated with N 
partons at tree level, a more accurate background estimate is obtained from the data themselves 
("data-driven") rather than from theoretical prediction of cross section and simulations. 

The CDF and D0 Collaborations previously measured the ti production cross section and the top 
quark mass in the all-hadronic channel Q using datasets with integrated luminosities of approximately 
HOpb" 1 collected at y/s = 1.8TeV during the Tevatron Run I (1992 - 1996). More recent Run 
II measurements of the cross section Q and mass @ have been performed by CDF using 311 pb -1 
collected with the CDF II detector at yfs = 1.96 TeV (March 2002 - August 2004). The results 
reported here are based on the data taken between March 2002 and February 2006, corresponding to 
an integrated luminosity of 1.02 fb _1 . These measurements complement other recent ti cross section 
and top quark mass determinations by CDF @, fl H EH and D0 [H E [l3| in other final states. 

The organization of the paper is as follows: Section|TT] contains a brief description of the CDF II 
detector. The trigger and the neural-network-based sample selection are described in Section HTTl The 
6-tagging algorithm and its efficiency for identifying b jets are described in Section llVl In Section|V]thc 
data-driven method we use for estimating the background from experimental multijet data is applied 
and the related systematic uncertainties are evaluated. Section lVll describes the optimization of the 
kinematical selection and the associated efficiency. The ti production cross section is presented in 
SectionfVlI] The reconstruction of the top quark mass and the corresponding background and signal 
distributions are described in Section lVIIIl the method for fitting these distributions is discussed in 
Section HXl Section [X] summarizes the expected contributions to the systematic uncertainty on the 
top quark mass measurement, while Section IXII describes the top quark mass measurement. Finally, 
cross section and mass measurements are summarized in Section R!!! 

II. THE CDF II DETECTOR 

The CDF II detector [l4| is an azimuthally and forward-backward symmetric apparatus designed to 
study pp collisions at the Tevatron. A cylindrical coordinate system as described in [l5| is used. The 
detector consists of a magnetic spectrometer surrounded by calorimeters and muon chambers. The 
charged particle tracking system is immersed in a 1.4 T solenoidal magnetic field with axis parallel 
to the beamline. A set of silicon microstrip detectors provides charged particle tracking in the radial 
range from 1.5 to 28 cm, while a 3.1 m long open-cell drift chamber, the central outer tracker (COT), 
covers the radial range from 40 to 137 cm. In combination the silicon and COT detectors provide 
excellent tracking up to about pseudorapidities [15] < 1.1, with decreasing coverage up to \ij\ < 2.0. 
Segmented electromagnetic and hadronic calorimeters surround the tracking system and measure the 
energy of interacting particles. The electromagnetic and hadronic calorimeters are lead-scintillator 
and iron-scintillator sampling devices, respectively, covering the range |?7| < 3.6. They are segmented 
in the central region (\rj\ < 1.1) in towers of 15° in azimuth and 0.1 in 77, and in the forward region 
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(1.1 < \r)\ < 3.6) in towers of 7.5° for \r]\ < 2.11 and 15° for \r]\ > 2.11. The electromagnetic 
calorimeters (l6l . Il7| are instrumented with proportional and scintillating strip detectors that measure 
the transverse profile of electromagnetic showers at a depth corresponding to the expected shower 
maximum. Drift chambers located outside the central hadronic calorimeters and behind a 60 cm iron 
shield detect muons with |ry| < 0.6 [HI]. Additional drift chambers and scintillation counters detect 
muons in the region 0.6 < |ry| < 1.5. Gas Cherenkov counters [l!| with a coverage of 3.7 < \r/\ < 4.7 
measure the average number of inelastic pp collisions and thereby determine the luminosity. 



III. MULTIJET EVENT SELECTION 



The all-hadronic final state of tt events is characterized by the presence of at least six jets from 
the decay of the two top quarks. Collisions are selected in real time using a multijet trigger, relying 
on calorimeter information, which was specially developed to collect the events used in this analysis. 
Subsequently, jets are identified off-line by grouping clusters of energy in the calorimeter using a fixed- 
cone algorithm with a radius 0.4 in r\ — & space [20} . The typical jet transverse energy [l5| resolution 
is approximately (0.1 x Et + 1-0) GeV 2l|, where Et is the jet transverse energy in GeV. After 
a preliminary selection of multijet events, a neural network selection based on relevant kinematical 
variables is used to provide the most precise cross section and mass measurements. 



A. Multijet Trigger 



The CDF trigger system has three levels. The first two levels consist of special-purpose electronics 
and the third one of conventional digital processors. For triggering purposes the calorimeter granularity 
is simplified to a 24 x 24 grid in rj — <f> space with each "trigger tower" spanning approximately 15° 
in </> and 0.2 in 7? covering one or two physical towers. At level 1, the jet trigger requires a single 
tower with transverse energy Etfi w > 10 GeV. At level 2 we require that the total transverse energy, 
summed over all the trigger towers, Yl Etf w , be > 175 GeV, and the presence of four or more clusters 
of calorimeter towers, each cluster with transverse energy E T ls > 15 GeV. In order to maintain an 
acceptable trigger rate, as the peak instantaneous luminosity increased, the ^t w threshold has 
been increased over the course of the data taking, 175 GeV being the latest value. This threshold 
has been applied, off-line, to the whole data set. Finally, the third trigger level confirms the level 2 
selection using a more accurate determination of the jet energy, requiring four or more reconstructed 
jets with Et > 10 GeV. A total of 4 340 143 events satisfy the trigger requirements, with an efficiency 
of about 58% for inclusive tt events, and of 80% for all-hadronic ti events. The efficiency has been 
estimated using Monte Carlo generated ti events, described later. This corresponds to the S/B of 
approximately 1/1100 for the theoretical cross section of 6.7 pb, where the background is represented 
by the multijet events themselves. 



B. Preselection and Topology Requirements 

Events satisfying the trigger requirements are reconstructed in terms of their final state observables 
(tracks, vertices, charged leptons, jets). We retain only those events that are well contained in detector 
acceptance, requiring the primary event vertex [22} to lie inside the luminous region (|z| < 60 cm). 
The jet energies are corrected for detector response and multiple interactions. First, we take into 
account the ^-dependence of detector response and energy loss in the uninstrumented regions. After 
a small correction for the extra energy deposited by multiple collisions in the same beam-beam bunch 
crossing, a correction for calorimeter non-linearity is applied so that the jet energies correspond, on 
average, to the energy of all the particles incident on the jet cone. All systematic uncertainties for 
the individual corrections are added in quadrature to obtain the total uncertainty on the estimate of 
the initial parton energy. This uncertainty goes from 8% to 3% with jet transverse energy increasing 
from 15 GeV to 50 GeV, and remains approximately constant at 3% above 50 GeV (23|. 
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For this analysis, each jet is required to have Et > 15 GeV and \r)\ < 2 after all corrections have been 
applied. In order to remove the events from the ti leptonic channels, we veto events containing any well 

identified high-p^ electrons and muons as defined in Q , and require that 7=^ be < 3 \/GeV @ , 

where the missing transverse energy, &t [2J] > is computed with reference to the detector origin and is 
corrected for any identified muons and the position of the pp collision point, while ^Et is obtained 
by summing the Et's of all the selected jets. At this stage, called "preselection", simulations show 
that the fraction of leptonic events amounts to about 14% of all accepted ti events. To avoid overlaps 
between jets we require jet pairs to be separated by at least 0.5 units in the 77 — <j> space. About 3.5 
million events pass these preselection requirements (S/B ~ 1/1000). Finally, we define the topology 
of the signal region by selecting events with a number of jets 6 < Aj 0ts < 8 to optimize the signal 
fraction. 506 567 events pass this additional requirement with an expected S/B of approximately 
1/370, i.e. about 0.3%; the remaining events with lower jet multiplicity have much smaller values of 
S/B and are used as control regions. The residual fraction of leptonic ti events amounts to only 5% 
of all accepted events. 

C. Neural-Network-based Kinematical Selection 

In order to further improve the S/B we use a neural network approach to recognize in more detail 
the features of signal and background events, including correlations between the kinematical variables 
which enter as input nodes in the network. We thus expect a better separation between signal and 
background relative to the former technique 0] where correlations were not fully considered. The 
network uses the mlpfit package [25] as implemented by ROOT [26| through the TMultiLayerPercep- 
tron class. We define a kinematical selection based on dynamical and topological properties of the 
candidate event. The number of variables used should allow the best possible description of the event 
properties, but, at the same time, too many input variables can worsen the performance, given the 
limited training statistics. As a guideline we studied different neural network configurations, in terms 
of inputs and hidden nodes, adding a few variables at a time, looking for the best performance in terms 
of largest S/B. The first quantities considered are those used in the total transverse energy of the 
jets, ^ Et', the quantity ^ 3 Et = Et — Et 1 — Et 2 , obtained by removing the contribution of the 
two jets with the highest Et] the centrality, defined as C = ^Ji T , where \/~§ is the invariant mass of 
the multijet system; and the aplanarity A, defined as A = §Qi, Qi being the smallest of the three 
normalized eigenvalues of the sphericity tensor, M ab = . P^Pj, calculated in the center-of-mass 
system of all jets, where the indices a and b refer to the spatial components of the jet four-momentum 
Pj. In addition, we consider the dynamical properties of dijet and trijet systems through the use of 
the minimum and maximum value of the invariant mass among all possible jet permutations: M^™ 1 ™, 
M£j ax , M™" and M^ ax . We obtain another set of discriminating variables combining the transverse 
energy of the jets with their emission direction, represented by the angle 9* between the jet direction, 
as measured in the center-of-mass frame of all jets, and the proton beam axis. The variable cos 8* has 
been shown [27j to have discriminating power against the background, and we use here the quantity 
E T = Et sin 2 0* which tends to have larger values in the signal in comparison to the background 
events; this effect is enhanced for the jets with higher Et- The variables we choose as additional 
inputs to the neural network are then Ef 1 and Ef 2 for the two highest-i/r jets, and {Elf) defined as 
the geometric mean over the remaining {Nj ots — 2) jets. The 11 variables used as inputs to the neural 
network are summarized in TableQ] Comparisons of the background-dominated data and Monte Carlo 
generated signal events for the 11 kinematical variables are shown in Figs. [1] to [5] The network is 
trained on same-size samples of signal and background events with 6 < Aj Cts < 8 (about 507000 
events). In order to model the signal we use the pythia v6.2 28] leading-order Monte Carlo generator 
with parton showering followed by a simulation of the CDF II detector. The reference top quark mass 
chosen for the training is M top = 175GeV/c 2 . The background is obtained from the multijet data 
events themselves, since the signal fraction at the initial stage is expected to be very small. Among 
the configurations investigated, the one which provides the largest expected S/B has two hidden layers 
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with 20 and 10 hidden nodes, respectively, and 1 output node. The value of the output node, N out , 
is the quantity we use as discriminator between signal and background, as shown in Fig. [5] for the 
6 < Njetg < 8 sample. 



TABLE I: Input variables to the neural network. 

Variable Description 

E*t Scalar sum of the transverse energies of all jets 

^2 3 E T As above, except the two highest- E T jets 

C Centrality 

A Aplanarity 

M™" Minimum dijet invariant mass 

]\^max Maximum dijet invariant mass 

j^jmm Mirurnum trijet invariant mass 

Mmax Maximum trijet invariant mass 

E T A E T sin 2 9* for the highest- E T jet 

E% Et sin 2 9* for the next-to-highest- St jet 

(Ej,) Geometric mean over the remaining jets 





FIG. 1: ^,Et (top) and Et (bottom) distributions in QCD multijet (solid histogram) and tt Monte Carlo 
(dashed histogram) events with 6 < 7Vj cts < 8. All histograms are normalized to unity. 



IV. TAGGING b QUARKS IN THE MULTIJET SAMPLE 

In order to estimate the tt content in the event sample, we exploit the heavy flavor content of tt events 
using a 6-tagging algorithm based on secondary vertex reconstruction as described in detail in [22l . [29| . 
The algorithm aims at the identification of jets containing a 6-hadron state by reconstructing its decay 
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FIG. 2: Aplanarity (top) and centrality (bottom) distributions in QCD multijet (solid histogram) and tt Monte 
Carlo (dashed histogram) events with 6 < A^ cts < 8. All histograms are normalized to unity. 





FIG. 3: M% in (top) and M% ax (bottom) distributions in QCD multijet (solid histogram) and tt Monte Carlo 
(dashed histogram) events with 6 < JVjets < 8. All histograms are normalized to unity. 
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M™ x (GeV/c 2 ) 

FIG. 4: My <n (top) and M™ 1 (bottom) distributions in QCD multijet (solid histogram) and ti Monte Carlo 
(dashed histogram) events with 6 < iVjets < 8. All histograms are normalized to unity. 




EV 1 (GeV) 




Ej 2 (GeV) 




<E T > (GeV) 

FIG. 5: Kinematical distributions in QCD multijet (solid histogram) and ti Monte Carlo (dashed histogram) 
events with 6 < iVjets < 8. From top: E^' 1 , E^ 2 and (E^). All histograms are normalized to unity. 
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FIG. 6: Neural network output, N out , for QCD multijet (solid histogram) and ti Monte Carlo (dashed his- 
togram) events with 6 < Njets < 8. Histograms are normalized to unity. The neural network implementation 
that we use in the TMultiLayerPerceptron produces an output which is not strictly bound between and 1. 



vertex with at least two high-quality tracks with hits in the silicon vertex detector. A 6-taggcd jet 
("tag", in brief) must have an associated secondary vertex with a displacement from the primary 
vertex in the transverse plane larger than 7.5 times the typical resolution of the vertex displacement 
of about 190 /im. Since, as shown in the next section, the background will be estimated in terms 
of inclusive tags rather than events, for the signal we do not consider the efficiency for tagging an 
event, but rather the quantity to be used in the cross section calculation is the average number of tags 
per event, n^™, which is related to the cross section using ti Monte Carlo calculations. The tagging 
efficiencies for jets coming from the fragmentation of b-, o, or light-flavored quarks are corrected 
according to the efficiency seen in the data, by a factor 0.89 ± 0.07 for b jets and 0.89 ± 0.14 for c jets 
respectively. These factors are described in detail in [22j . 

We find that the average number of tags present in a ti event after the preselection depends on the 
choice of the lower threshold, iV™™ (see Fig. [7]) for the neural network output N out . For any value of 
this threshold the systematic uncertainty on n^™ is dominated by the uncertainty of the correction 
factors for tagging b and c jets. 



V. BACKGROUND ESTIMATE 



The background for the ti multijet final state comes mainly from QCD production of heavy-quark 
pairs (bb and cc) and false tags in light-quark jets. Other standard model processes such as W/Z+jets 
have a smaller production cross section and small acceptance due to the selection cuts. Due to the 
current uncertainties on the Monte Carlo generation of QCD multijet events, we estimate the back- 
ground from the data themselves. Given the theoretical uncertainties on the production cross section 
for events generated with N partons at tree level, a more accurate background estimate is obtained 
from the data themselves rather than from theoretical prediction of cross section and simulations. 
The tag rate per jet is defined as the probability of tagging a jet whose tracks are reconstructed in the 
vertex detector. This rate is extracted from events depleted in ti signal, and is used as an estimate of 
the rate expected in events with different jet multiplicity. These depleted events, with exactly four jets 
passing the preselection ("control sample"), have S/B w 1/3600. This method intrinsically provides 
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FIG. 7: Average number of tags, rij* 



for tt Monte Carlo events passing 



the preselection and with 6 < iVjets < 8. The dashed lines represent the 1-a uncertainty on 



an inclusive estimate in terms of number of tags rather than number of tagged events. The tag rate 
per jet is evaluated in this control sample and is parameterized in terms of variables sensitive to both 
the tagging efficiency for true heavy-flavored objects and the rate of false tags. These variables are jet 
Et, the number of tracks reconstructed in the vertex detector and associated to the jet, N^, and the 
number of primary vertices in the event, -/V vert . The tag rates per jet as a function of these variables 
are shown in Fig. [8] for jets with at least two tracks within the vertex detector acceptance ("fiducial" 
jets). 

The tag rate estimates the probability that a given fiducial jet in the signal sample is tagged. 
Summing this probability over all fiducial jets, we obtain the expected number of tags from non-signal 
processes, that is QCD heavy- and light-flavored production altogether. Before the neural-network- 
based kinematical selection, the multijet sample is composed essentially of background events. The 
accuracy of our modeling of the background processes, assuming all jets to be uncorrelated, is shown 
in Figs. [H [lOl and [TT1 where we compare the N out distributions for tags in the data to the expected 
background in events with 4, 5 and 6 or more jets. The disagreement between the total number of 
observed and expected tags, possibly due to the presence of a small sample of tt events, amounts to 
no more than 0.8%. This small discrepancy observed at high jet multiplicity is accounted for as a 
systematic uncertainty on the background estimate due to the different jet multiplicity in the signal 
and control samples. The kinematical selection cutting on N out also changes the event characteristics 
with respect to those found in the sample with exactly four jets, where the parameterization has been 
derived. This selection modifies the jet-£r and T) spectra so that the average tag rate per event for 
jets from QCD background becomes higher. However, the parameterization of the tag rate in terms 
of properties of the jet (Et and N^) is shown to describe well this increase. Residual biases due 
to the neural network selection are treated as systematic uncertainties on the background prediction 
considering, in the control sample with four jets, subsets of events with N out > N™\ n with iV™™ 
in the range 0.8 — 1.0. In this case the disagreement between the total number of observed tags 
and the expected background is no more than about 2.4%, due to the neural network kinematical 
selection. The total systematic uncertainty expected on the background estimate amounts then to 
2.5%, the quadrature sum of the two uncertainties. The contributions from running conditions, such 
as instantaneous luminosity and detector configuration, have been studied and found to be negligible. 
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FIG. 8: Tag rate for fiducial jets as a function of jet Et, and iVvcrt, as measured in the control sample 
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FIG. 10: Neural network output, N out , distribution for tags in the data events with 5 jets, compared with the 
estimate from the tag rate parametrization. Events with multiple tags have multiple entries. 



VI. OPTIMIZATION OF THE KINEMATIC AL SELECTION AND ITS EFFICIENCY 

In order to obtain the most precise cross section measurement, the neural-network-based kinematical 
selection is optimized, in the sample 6 < N- iCts < 8, for the maximum signal significance for tt events, 
defined as the ratio between the expected signal (assuming the theoretical production cross section 
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FIG. 11: Neural network output, N out , distribution for tags in the data events with 6 to 8 jets, compared with 
the estimate from the tag rate parametrization. Events with multiple tags have multiple entries. 



of 6.7 pb) and the total uncertainty on the sum of signal and background, where both statistical and 
systematic uncertainties are considered. Since we have an accurate background prediction only after 
the 6-tagging requirement, the optimization refers to tagged tt events and the expected background. 
The cut on the neural network output which provides the maximum signal significance is N out > 0.94, 
as can be seen in Fig. [12] Such a selection yields, before 6-tagging, 4205 candidate events in the data 
with an efficiency of 4.8% for the signal and with S/B ~ 1/12. 

The effect of the selection on tt events and on the data is summarized in Tabic [ill The relative 
contribution from the leptonic channels after all the cuts is small, about 3%. A summary of the data 
as a function of jet multiplicity is shown in Table WU\ 



TABLE II: Efficiency of the kinematical selection measured from pythia tt Monte Carlo simulation for 
Mto P = 175 GeV/c 2 and number of events selected in the data (before requiring fc-tagging). 

Requirement Efficiency (%) Data 
Trigger 58 4340143 

Preselection 38 3480768 

6 < iVjcts < 8 20 506567 

N out > 0.94 4.8 4205 



The systematic uncertainties affecting the tt production cross section are summarized in Table IIVI 
with a total relative contribution of 17% on the kinematical selection efficiency. The uncertainty of 
16.3% arising from the jet energy scale is dominant, since this analysis requires the presence of a large 
number of jets in the event which are used to build the set of kinematical variables employed in the 
selection. This uncertainty is evaluated by the shift in signal acceptance observed by changing the jet 
energy corrections within their uncertainties. Less relevant sources of uncertainty are associated with 
different Monte Carlo hadronization schemes (1.1%), increased and decreased initial and final state 
radiation (ISR and FSR) (2.9%), and the variation of parton distribution functions (PDFs) within 
their uncertainties (1.4%). A detailed description of the procedure used to estimate these uncertainties 
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TABLE III: The number of events in data before and after the neural network selection, N out > 0.94, for 
different jet multiplicities (before requiring b-tagging). 



Jet Events before Events after 



Multiplicity 


selection 


selection 


4 


1341622 


118657 


5 


917999 


16157 


G 


372091 


2575 


7 


109295 


1069 


8 


25181 


561 



can be found in [8| . The difference in the amount of multiple beam interactions present in the data 
events and in the Monte Carlo simulation is also accounted for (2.5%). 

After the kinematical selection with N out > 0.94, the ^-tagging selects 1020 events with 1233 
candidate tags. The estimated background amounts to 937 ± 30 tags, while for ti events we expect 
an average number of tags nf™ = 0.95 ± 0.07. Since the background estimate is obtained from all the 
events passing the selection before tagging, we need to subtract the contribution due to the ti events, 
as obtained from the excess of candidates with respect to the background, divided by n^™ 30]. This 
contribution is then subtracted from the number of events before tagging to obtain, iteratively, a new 
background estimate. After this correction, the number of tags expected from background sources is 
reduced to 846 ± 37 tags, where the increased uncertainty accounts for the uncertainty on n^™- 



VII. CROSS SECTION MEASUREMENT 



In the signal region 6 < A?j e t s < 8, the excess of observed candidate tags in the data over the 
background is ascribed to ti production. A measurement of the cross section can be extracted from 
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TABLE IV: Relative systematic uncertainties on the signal efficiency, and other uncertainties related to the 
cross section. 



Source 


Uncertainty (%) 


Energy Scale 


16.3 


Parton Distribution Functions 


1.4 


Initial/Final State Radiation 


2.9 


Monte Carlo Modeling 


1.1 


Multiple interactions 


2.5 


Average number of tags 


7.4 


Estimated background 


2.5 


Integrated luminosity 
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FIG. 13: Number of candidate tags as a function of the jet multiplicity for the data after the kinematical 
selection in the signal region, compared with the expected background. The ti expectation is based on the 
measured cross section of 8.3 pb. See TablelVl 



TABLE V: Observed number of tags and expected background and signal after the kinematical selection 
N ou t > 0.94. The corrected background accounts for the presence of it events before tagging. The uncertainties 
correspond to the quadrature sum of statistical and systematic uncertainties. See Fig. 1131 



Jet Multiplicity 4 5 


6 7 8 


Background 16060 ± 575 2750 ± 92 
Corrected Background 15961 ± 677 2653 ± 112 
tt (a t t = 8.3 pb) 120 ± 20 266 ± 45 
Background + tt 16081 ± 677 2919 ± 121 


536 ± 17 255 ±8 146 ±5 
481 ± 20 223 ±10 142 ± 7 
242 ±41 101 ± 17 38 ±7 
723 ±46 324 ± 20 180 ± 10 


Data 16555 3139 


725 349 159 
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the acceptance and the background estimate: 

N bs — ^Vbkg / j\ 

tkin A "tag A '-int 

where iV b s = 1233 and A^kg = 846 ± 37 are the number of total observed and background tags, 
respectively, in the signal region 6 < N- ]cts < 8, eun = 4.8 ± 0.8% is the signal kinematical selection 
efficiency, n*™ = 0.95 ± 0.07 is the average number of tags in ti events, and C- m t = 1.02 ± 0.06 fb" 1 
is the integrated luminosity of the data sample. Given the nature of our background estimate, in 
the above formula we use tagged jets instead of tagged events. We have verified however that this 
does not introduce any bias in the cross section, but leads to a small underestimate of the statistical 
uncertainty, due to the fact that some events have two or more tags. The statistical uncertainty 
is inflated appropriately (w +20%), as determined from a set of Monte Carlo simulations where the 
number of expected signal and background events with 0, 1, 2 or more tags are fluctuated according to 
Poisson distributions. The measured value of the ti cross section is ct±t = 8.3 ± 1.0(stat.)^'°(syst.) ± 
0.5(lumi.) pb for a top quark mass of 175 GeV/c 2 . In Fig.[T3]the distribution of the number of observed 
tags and expected background is compared in the signal region to the ti signal expectation assuming 
the production cross section measured in this analysis. A good agreement is observed for all jet 
multiplicities after the kinematical selection N out > 0.94, as can be seen in Table |Vj In addition we 
show in Fig. [TJ] that the measured cross section is in agreement with the data over a wide N out range. 




VIII. MASS RECONSTRUCTION 



Having evaluated the ti production cross section in the preceding section, we proceed to measure 
the top quark mass from the same candidate events with a constrained-fitting technique. Since the 
kinematical selection described above is designed to have the best accuracy on the cross section 
measurement, we allow here for a different cut on N out in order to find the value which provides the 
best accuracy for the mass measurement. We search in simulated ti events the N™™ value which 
provides the smallest statistical uncertainty expected on the mass measurement (36j . 
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For each event we determine a reconstructed top quark mass, m[ eco , from the four- momenta of the 
six highest- Et jets. This mass is entered into a distribution which is used to determine the most likely 
top quark mass from the data sample (see Section lXlj) . 



A. Kinematical fitter 



Sixteen equations can be considered to connect the four-momenta of the two top quarks and the 
six final state particles according to the ft — > bbW + W~ — > bbqiq~2 q^qi hypothesis: 

P?=PU+P$ (2) 

$=1&-+$ (3) 

P"w- = (4) 

P"w + = ( 5 ) 



(M = 0,l,2,3) 

There are 13 unknown quantities, i.e., the unknown top quark mass and the three-momenta of the 
top quarks and of the W bosons, so the kinematics of the event is overconstrained. Only the six 
highest--&r jets are used as inputs to the kinematical fitter, resulting in 90 possible permutations of 
two jet doublets giving a W and of two jet triplets giving the top quarks. Only events with at least 
one b tag among the six highest--&r jets are used in this analysis, with the association of the tagged 
jet to a b quark, reducing the number of possible jet-to-parton permutations to 30. We construct the 
X 2 function 

2 _ {m jh -m w ) 2 (m jh - m w ) 2 {m jjh - m r t eco f 

X p2 ' p2 + p2 

1 W L W 1 t 



1 pi i—1 2 ' \°) 

1 t a i 

where rrijj 1 2 are the invariant masses of the dijet systems, rrijjj 1 2 are the invariant masses of the 
trijet systems, Tw = 2.1 GeV/c 2 is the measured natural width of the W^[3l[, and T t , fixed to 1.5 
GeV/c 2 , is the assumed natural width of the top quark [32l|. The measured jet transverse energies, 
pmeas are f ree ^ Q var y w ithin their known resolution, ai. The \ 2 is minimized with respect to the 

7 free parameters {m r t eco ', and the 6 jets transverse momenta p^\) for each of the 30 permutations 
of jets with final state partons. The permutation with the lowest x 2 is selected and a distribution 
("template") of m r t eco is then formed to be used for the determination of the true top quark mass. 

In order to reconstruct a data-driven background template we apply the kinematical fitter to the 
sample of events passing the neural network selection, but before the requirement of identified b jets. 
Within an event the fit is performed once for each fiducial jet, assuming it is a &-quark. The resulting 
value of m r t ec ° enters the template with a weight given by the tag rate associated with the fiducial jet. 
The integral of the m[ eco distribution is the sum of all weights and represents the expected number of 
background tags. This procedure does not allow a separation between the background expected for 
events with 1 or 2 tags, but treats them together. 

We follow the same approach for the data and the signal simulation, so the fit is performed for 
each association of tagged jets with one or the other of the two b quarks; events with more than 1 
tag contribute then with multiple entries. A control sample where we expect the signal to be present 
in fractions of few percent or below, is defined by 0.1 < N out < 0.8 and is further subdivided in 4 
subregions, to check the goodness of the background modeling. As shown in Figs. [12] and [TCI the 
background is found to describe well the m r t eco distributions in all the subsamples. 
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FIG. 15: Comparison of the expected versus observed ra™°. Control regions 0.1 < Nout < 0.2 (top) and 
0.2 < N out < 0.4 (bottom). The x 2 probability is indicated in the legend. 
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FIG. 16: Comparison of the expected versus observed rn[ eco . Control regions 0.4 < N ou t < 0.6 (top) and 
0.6 < Nout < 0.8 (bottom). The \ 2 probability is indicated in the legend. 
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IX. LIKELIHOOD FIT 

The technique described above gives a distribution of m T t eco (one for each tag in the event) in 
the corresponding data sample, which is a mixture of signal and background. In order to measure 
the top quark mass, we compare the m r t eco distribution from the data to the signal and background 
templates. From the templates we first derive probability density functions (p.d.f.'s) and then perform 
an unbinncd likelihood fit to determine the value of true top quark mass, M t0 p, that best describes 
the data. 



A. The Likelihood Function 

The p.d.f.'s for the m r t eco distributions are parametrized in order to have a functional form which 
varies smoothly with M top - For the signal, we use tt Monte Carlo events generated with herwig 
v6.508 with top quark masses ranging from 150 to 200 GeV/c 2 in 2.5 GeV/c 2 increments. The 
function for the signal p.d.f., P S i g (m|Mt op ), represents the probability to obtain a value m for m^ eco , 
given a true top quark mass M top in a tt event. The form used is a sum of a Gamma distribution, 
chosen to describe the reconstructed top quark mass in case of incorrect jet-parton assignments, and 
two Gaussian distributions, which model the core of the distribution. Its explicit expression is 

S 1+Sl 

P sig (m\M top ) = S 7 ■ 2 • (m - S ) Sl ■ exp(-<5 2 (m - <*„)) 

1 (1 + Ol) 



+5$ ■ —= — • exp 



/2k6a ~' r V 2<5 2 
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1 /_( m _<y B )S 



where each parameter Si is linearly dependent on M% op , 

Si = at + Pi ■ (M top - 175) (i = 0, 1, ...8) (8) 
so the total number of parameters used is 18. 

Fig-EZl shows some of the signal templates along with their parametrized p.d.f.'s. 

The background reconstructed mass is computed as described in the previous section. The integral 
of this distribution is the predicted amount of background tags, which is corrected for the presence 
of tags expected from tt events using the iterative technique described in Section IVII In order to 
account for the shape of the signal mass distribution, the correction is made by substracting from the 
background distribution the residual tt contamination distributed in mass as expected for M top = 175 
GeV/c 2 . The systematic uncertainty associated with this procedure is estimated and reported in the 
next section. The resulting background template is parametrized with two Gamma distributions and 
one Gaussian distribution. The background distribution does not depend on the top quark mass. The 
resulting p.d.f., Pbkd(m), is 

5 1+Sl 

Pbkd(m) = S s ■ , 2 r • (m - S Q ) Sl ■ exp(-S 2 (m - S Q )) 
I (1 + di) 
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FIG. 17: Four m[ eco signal templates with their p.d.f. overlaid. 
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FIG. 18: ml ec ° background template with its p.d.f. overlaid. 
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and is plotted in Fig.fTSl 



23 



The likelihood function, C, is constructed by assuming that the data are described by an admixture 
of background and tt events with a certain top quark mass. The function is obtained by multiplication 
of three terms. The first two terms constrain the number of background tags, rib, to the expectation, 
n b ' , and the number of signal and background tags, n s + rib, to be equal to the number observed 
in the data, N. In the third term the signal and background probabilities are assigned by comparing 
the measured value rtii of m r t eco from the data with the parameterized signal and background p.d.f.'s, 
P S i g and P bk d- 

C = exp — j- x exp ' 



2< ) " V 2o% 



-tt n s ■ P S ig{mi\M top ) + n b ■ Pbkdjmj) 
n s +rib 



x 



where a Ub is the expected uncertainty on the corrected background, and <7/v = v N is the expected 
uncertainty on the total number of observed tags, N. In order to facilitate the computation, we 
minimize the negative logarithm of the likelihood, —InC, instead of maximizing the likelihood itself. 
The minimization is performed with respect to the three free parameters, n s ,rib and M top . The 
statistical uncertainty on the top quark mass is taken from the points where the —InC changes by 0.5 
units from its minimum. 



B. Verification and Calibration of the Method 

We investigate for possible biases in the top quark mass measurement which can be introduced by 
our method, and quantify its statistical power before performing the actual measurement on the data 
sample. We run Monte Carlo simulations of the experiment ("pseudo-experiments"), generated with 
the true top quark mass ranging from 150 to 200 GeV/c 2 , and then extract the predicted amount 
of m[ ec ° values from the signal and background templates. We fix the total number of tags in each 
pseudo-experiment to be the same as observed in the data. Then we take our background expectation 
of tags passing the cuts, fluctuate it according to Gaussian statistics, and get the number of signal 
tags as the difference between N and the fluctuated background. We perform the measurement on 
many different sets of Monte Carlo events in the pseudo-experiment ( "pseudo-events" ) , and plot the 
fitted top quark mass with respect to the input mass in Fig. 1191 seeing no systematic bias. We define 
the "pull" of the fit variable to be the deviation of the fitted mass from the true value in the pseudo- 
experiment, divided by the measurement uncertainty determined in the fit. The pull distribution 
is fitted with a Gaussian and its width ( "pull width" ) indicates the accuracy of the uncertainties 
obtained from the fit (see Fig. l20"|) . 

The sensitivity check is performed over a range of thresholds for both N out and x 2 , to achieve 
the smallest expected statistical uncertainty. The N™™ cut is varied between 0.88 and 0.96 while 
the upper threshold, xLu varies between 30 and 6. The cuts for which we expect the smallest 
statistical uncertainty on the mass measurement are N out > 0.91 and x 2 16- The x 2 cut improves 
the background rejection so that we need to apply a threshold on N out lower than that used for the 
cross section measurement. 



X. SYSTEMATIC UNCERTAINTIES ON THE TOP QUARK MASS 

Various sources of systematic uncertainty affect the top quark mass measurement. Systematic effects 
arise from mismodeling in the simulation of the detector response to jets, and from uncertainties in 
the simulation of the tt signal. The evaluation of the mass shift due to each source of systematic 
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FIG. 19: Fitted mass as a function of the input top quark mass using pseudo-experiments. The fit slope is 
consistent with 1. The coefficients of the fitted straight line are indicated in the legend. 



£ 1.3 
| 1.2 

3 

°- 1.1 

1 

0.9 
0.8 
0.7 

150 160 170 180 190 200 

Input Mass (GeV/c 2 ) 

FIG. 20: Width of the pull distributions, for different top quark input masses. 



uncertainty consists in generating pseudo-experiments varying the effects from each possible source 
by ±lcr, building new templates out of the varied sample and determining how much the fitted top 
quark mass shifts. We take as an estimate of the respective uncertainty the resulting half-difference 
between the extreme values of the measured mass. If the shift is smaller than the statistical uncertainty 
on the estimate itself, we assign the latter as the systematic uncertainty. 

The main contribution to the systematic uncertainty stems from the residual uncertainty on the 
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jet energy after it is corrected for the known effects [23(. These include calorimeter non-linearity in 
response to single hadronic particles, energy loss into non-instrumented calorimeter regions, energy 
added to the jet by secondary interactions and the underlying event, and energy lost outside the 
jet cone. We calculate the systematic uncertainty originating from each of these sources varying 
each corrected jet energy in the simulation by the corresponding uncertainty and performing pseudo- 
experiments with the modified resulting templates, and finally adding the observed mass shifts in 
quadrature to quote a total systematic due to the jet energy scale. Since the jet energy corrections 
are derived on data samples deprived of heavy quarks, we add an additional uncertainty of the order 
of 0.6% evaluated considering the different fragmentation properties of b quarks [3] ■ As done for the 
generic jet energy scale uncertainties, we perform pseudo-experiments where we vary the 6-jet energy 
scale accordingly and use half the variation in the fitted top quark mass as the 6-jet energy scale 
uncertainty. 

Many sources of systematic effects arise from uncertainty in the Monte Carlo modeling of the 
hard interaction, pythia and herwig generators differ in their hadronization schemes and in their 
description of the underlying event and multiple interactions. A corresponding systematic uncertainty 
is evaluated drawing top quark reconstructed masses from PYTHiA-generated events and comparing the 
resulting mass distributions with the template constructed using herwig. Additional jets coming from 
initial and final state radiation (ISR and FSR) might fall among the six leading jets and populate the 
tails in the m r t eco distribution. These effects are studied using pseudo-experiments where we extract 
top quark reconstructed masses from templates generated with different values of Aqcd and scale 
factor K [l(|, and comparing them with the standard templates. Since the shift is very small, we 
assign the statistical uncertainty on the shift to be the ISR/FSR systematic uncertainty. 

The choice of parton distribution functions (PDF) inside the proton affects the kinematics of tt 
events and thus possibly the top quark mass measurement. We estimate the uncertainty from the 
difference in top quark mass resulting from the use of Monte Carlo samples based on the default 
CTEQ5L [H PDF and on the one calculated from the MRST group H, MRST72 and MRST75, 
which differ by the value of Aqcd used to compute the PDF. 

The background normalization is known to 5% from the tag rate parametrization technique. We vary 
the background contribution using pseudo-experiments where we increase or decrease the expected 
background amount by its uncertainty. 

We consider also the uncertainty associated with the small presence of signal in the data-driven 
background. To do so we build two background templates where we subtract from the background mass 
distribution the expected signal mass distribution assuming the two values 172.5 GeV/c 2 and 177.5 
GeV/c 2 for the top quark mass. We reconstruct the top quark mass from the two background templates 
and take the difference in the results as the uncertainty associated to this effect. A systematic 
uncertainty due to the finite size of the Monte Carlo samples used to determine the mass templates is 
determined by varying each of the template bin entries randomly consistent with a Poisson distribution, 
creating 100 such new templates, reparametrizing them to determine as many fitted top quark masses. 
The width of the resulting distribution is used as the systematic uncertainty. 

A bias in the measurement can arise if an inadequate functional form is used for the m r t eco templates. 
This is checked performing pseudo-experiments where we extract mass values directly from the m r t eco 
histograms and compare them with the parametrized p.d.f.'s. The average of the difference between the 
fitted top quark masses and the input masses is chosen as a systematic uncertainty on the functional 
parametrization. The 6-tagging efficiency agrees well between data and simulation; still a possible 
dependence on jet kinematical properties could lead to a shift in the measured mass. We evaluate 
here a systematic due to Et dependence of 6-tagging scale factors allowing for a slope on the Et 
dependence (consistent at la with the measurements), and determining the shift in the fitted top 
quark mass using the modified templates. Since the background estimate is data-driven, the analysis 
is sensitive to an overall uncertainty in the 6-tagging scale factor only through signal shapes. 

In Table IVT1 is shown a summary of all the systematic uncertainties; the total systematic uncertainty 
amounts to 4.8 GeV/c 2 . 
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Source 


Uncertainty (GeV/c 2 ) 


Jet energy scale 


4.5 


Generator 


1.0 


b-jet energy scale 


0.5 


Parton Distribution Function 


0.5 


Background shape 


0.5 


Background fraction 


0.5 


ISR 


0.5 


FSR 


0.5 


b-tag 


0.5 


MC statistics 


0.1 


Template parametrization 


0.1 


Total 


4.8 



TABLE VI: Breakdown of systematic uncertainties from different sources. 

XI. MASS MEASUREMENT 

After the kinematical selection with N out > 0.91, the 6-tagging requirement and the cut on the 
goodness of the fit, \ 2 < 16, we find 926 tags in 772 events. The background, corrected as in the cross 
section measurement for the contamination of it events (see Section lVl]) . amounts to 567 ± 28 tags. 

The likelihood fit is applied to the data sample to derive Mt op = 174.0 ± 2.2(stat.) ± A.8(syst.) 
GeV/c 2 . The plot in Fig.l2"T1 shows the fitted top quark mass distribution for the data compared to 
the expected background and the signal for a top quark mass of 174.0 GeV/c 2 . The plot in Fig. [22] 
compares the measured statistical uncertainty with the expected distribution from pseudo-experiments 
using as input mass M top = 174.0 GeV/c 2 . We find that the p- value for our statistical uncertainty is 
40%. As for the case of the cross section measurement, Monte Carlo simulations indicate the need to 
increase the statistical uncertainties by 5% to account for the use of multiple entries, one for each tag 
in the same event, and their correlations. 

As a last check, we perform the measurement removing from the likelihood definition the Gaussian 
term which constrains the number of background tags to be as predicted via the tag rate parameteri- 
zation, and we obtain nearly the same value, M top = 174.1 ± 2.2(stat.) GeV/c 2 . 

The measurement presented here is the most precise measurement to date of the top quark mass in 
the all-hadronic decay channel. The result is consistent with the measurements obtained in the same 
channel at y/s — 1.8 TeV [6j, with the measurement obtained at ^fs = 1.96 TeV using £j„ t = 311 
pb _1 of data with a different technique and with the results obtained in the other channels by 
CDF [TH and D0 Collaborations. 

XII. CONCLUSIONS 

Using an optimized neural-network-based kinematical selection and a 6-jet identification technique, 
we are able to improve the S/B of the initial multijet sample obtained with a dedicated trigger 
from 1 in 1100 to approximately 1 in 2. With the selected sample, we measure the tt production 
cross section to be — 8.3 ± 1.0(stat.)l 2 '5(syst.) ± 0.5(lumi.)pb assuming M top = 175GeV/c 2 , 
improving the previous measurement (37| in the same channel These results agree well with the 
standard model expectation of <7j.t = 6.7 pb for the same value of the top quark mass and with the 
results obtained in the leptonic channels. Using a slightly modified selection we reconstruct the top 
quark invariant mass for the overconstrained kinematical system and compare it to parameterized 
templates representing signal and background. A likelihood fit is used to measure a top quark mass 
of Mtop = 174.0±2.2(stat.)±4.8(syst.) GeV/c 2 , which improves the previous measurement [38] in the 
same channel Q, and agrees well with the results obtained in the leptonic channels. 
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FIG. 21: The fitted top quark mass distribution for events with N ou t > 0.91, \ 2 < 16 and at least 1 fe-tagged 
jet. Superimposed are the background and the tt signal expected for M top = 174.0 GeV/c 2 . 
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